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§1  Introduction. 

Let  F(x)  be  a  distribution  function  on  the  real  line  with  finite  second  order 
moment.  Then  the  quantile  function  F1-1^)  is  square  integrable  as  a  function  of  u  £ 
(0,1).  Hence  1<  *(u)  can  be  expanded  into  a  Fourier  series  with  respect  to  a  complete 
orthonormal  system  of  L2{0, 1):  the  space  of  square  integrable  functions  on  (0,1).  This 
idea  was  used  by  Sugiura  (1962,1964)  in  connection  with  approximation  of  expected  values 
of  order  statistics.  In  this  paper  we  first  note  that  various  complete  orthonormal  systems 
of  L2[ 0,1]  can  be  obtained  by  quantile  transformation  of  complete  orthonormal  systems 
of  L2([<)  where.  L2(F)  denotes  the  space  of  square  integrable  functions  with  repect  to  a 
distribution  F,  for  example  the  normal  distribution.  If  we  take  the  normal  distribution  as 
F,  then  the  Fourier  coefficients  can  be  thought  as  components  with  respect  to  an  infinite 
dimensional  coordinate  system  whose  origin  is  the  normal  distribution  itself.  This  gives  a 
very  crisp  idea  of  distances  and  directions  between  various  distributions.  Furthermore  the 
Fourier  series  expansion  of  quantile  function  closely  parallels  the  Cornish-Fisher  expansion 
(Fisher  and  Cornish,  I960)  and  gives  a  general  method  of  approximating  quantiles  of  one 
distribution  based  on  quantiles  of  another  distribution.  The  Cornish-Fisher  expansion  is 
asymptotic  in  its  nature,  whereas  our  expansion  leads  to  a  convergent  series.  These  ideas 
are  discussed  in  Section  2. 


When  F  is  taken  to  be  an  empirical  distribution  function  Fn,  the  Fourier  coefficients 
of  I' n  provide  a  decomposition  of  the  sum  of  squares  constituting  the  Shapiro-Francia 
statistic  (1972).  This  is  analogous  to  the  components  of  the  Cramer-von  Mises  statistic 
discussed  by  Durbin  and  Knott  (1972)  and  Durbin,  Knott,  and  Taylor  (1975).  See  also 
Durbin  (1973).  As  in  the  case  of  components  of  the  Cramer-von  Mises  statistic,  various  tests 
can  be  constructed  from  the  components  depending  on  the  possible  alternative  hypotheses, 


whereas  the  Shapiro-Francia  test  serves  as  a  test  which  detects  an  overall  departure  from 
the  null  hypothesis.  The  Shapiro-Francia  statistic  and  its  components  can  be  applied  to 
nonnormal  distributions  as  well.  See  Section  3  for  these  developments. 


In  the  study  of  asymptotic  distributions  of  the  Shapiro-Francia  test  statistic,  de 
Wet  and  Venter  (1972)  implicitly  used  our  components  as  eigenfunctions  of  the  asymptotic 
covariance  function  of  the  quantile  process.  In  the  normal  case,  the  eigenfunctions  are  the 
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Hermite  polynomials.  This  has  been  noticed  in  other  contexts  too.  See  Stephens  (1975). 
Despite  this  close  analogy  to  the  components  of  the  Cramer-von  Mises  statistic,  components 
of  the  Shapiro-Francia  statistic  have  not  been  taken  up  as  useful  tools  in  themselves.  We 
have  to  add,  though,  that  components  which  are  closely  related  to  ours  have  been  discussed 
in  the  framework  of  weighted  Cramer-von  Mises  tests  with  various  weight  functions.  See  de 
Wet  and  Venter  (1973a, b),  Pettitt  (1977,1978).  This  connection  is  based  on  the  asymptotic 
equivalence  between  the  Shapiro-Francia  statistic  and  certain  weighted  Cramer-von  Mises 
statistic.  See  Gregory  (1977a, b).  We  give  a  brief  discussion  on  this  point  in  Section  4. 

§2  Orthogonal  expansion  of  quantile  function. 

2.1  Mallow’s  d2-distance  and  an  angle  between  two  distributions. 

Let  r2  be  the  set  of  (right  continuous)  distribution  functions  on  the  real  line  having 
finite  second  order  moments,  P2  =  {  F  :  f  x2dF  <  oo  }.  Let  F~{  be  the  quantile  function 
which  we  take  to  be  right  continuous  for  definiteness:  F~i(u)  =  sup{z  :  F[ x)  <  v  }.  For 
F  £  P2  we  have  x'2dF  —  /01(l'1~l(w))2<lu  <  oo.  Hence  by  taking  the  quantile  function 
r2  is  mapped  into  L2[0,1],  the  space  of  square  integrable  functions  on  (0,  1).  Mallow’s 
^2-distance  between  Fx  and  F2  in  P2  is  defined  by  the  usual  inner  product  <,>  of  L2[0, 1], 
namely 

(2.1)  d2(Fl,F2)  =  {  [  (FTl(u)-F^(u))2du}1/2. 

J  0 

A  nice  discussion  on  this  is  given  in  Section  8  of  Bickel  and  Freedman  (1981). 

Proposition  2.1*  Let  Fi,F2  £  IV  Then  d2(F1,  F2)  =  0  if  and  only  if  F\  —  F2. 

Proof  is  easy  and  omitted.  From  now  on  we  are  mainly  interested  in  the  cosine  of 
angle  between  two  distibutions  obtained  from  d2-distance.  Let  U  be  uniformly  distributed 
on  (0, 1).  We  define 


(2.2) 


p{F1,F2)  =  Cor(FT1{U),F^\U)). 
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Note  that  p  is  invariant  with  respect  to  location  and  scale  changes  in  I<\  and  F2.  For 
example  if  Fl{x)  =  F0((x  -  a)/b)  =u  (b  >  0)  then  =  o  +  bF^u).  Hence 


AFuFi)—  Cor(a  +  bF0  1{U},F2  1{U))=  Cor (F0  \U),F2  1  (£/))=  p{F0,F2).  Similarly  for 


F2.  Therefore  p  can  be  interpreted  as  the  cosine  of  angle  between  location  scale  families. 


Also  note  that  p  is  nonnegative  because  Fl1,F21  are  both  monotone  increasing. 


Proposition  2.2.  p(FuF2)  =  1  if  and  only  if  F,  is  a  location  scale  transform  of  F2. 

This  follows  from  the  fact  that  if  Ft,F2  have  mean  0  and  variance  1,  then 
dz{FuF2Y  =  2(1  -p{FuF2)). 


2.2  Orthonormal  basis  with  respect  to  a  specific  distribution. 


By  taking  the  quantile  functions  we  mapped  T2  into  L2[0,1].  Now  we  take  an 
appropriate  complete  orthonormal  system  in  L2[ 0, 1].  The  important  point  is  that  this  can 
be  done  with  respect  to  a  specific  distribution.  First  we  prove 


Proposition  2.3.  Let  F(e  P2)  be  a  continuous  distribution  function  which  is  supported 
(i.e.,  strictly  increasing)  on  an  interval.  Then  }°L0  is  a  complete  orthonormal  system  of 

"*■  ^  '  -f  <t>(x)  dl'  (x)  <  00  }  if  and  only  if  {ipi  o  F~'  1  }“10  is  a  complete  orthonormal 
system  of  L2{ 0,1]. 

Proof:  Let  be  a  complete  orthonormal  system  of  L2{F).  Then 

J  ipi[F  1{u))t/jJ{F  1{u))du  —  f  'ipi{x)'tp:j(x)dF(x)  =  6ij. 

Hence  -tpi  0  f  l’s  are  orthonormal.  To  prove  completeness  suppose  /  £  L2 [0,1]  and  < 
/,  ?/>*■  o  F~x  >=  0,  i  =  0,  l, . . . .  By  assumpion  F(F~l(u))  =  u.  This  implies  that  /  o  F  G 
L2(F)  and  0  =  <  f.^oF^  >- f f(F{x))^(x)dF ,  i  =  0, 1, ....  Hence  by  completeness 
of  }i=o>  /  o  F  =  0  in  L2(F)  or  f(F(x))2dF(x)  =  0.  Hence  f(u)2du  =  0.  This 
shows  that  {  V’i  0  F  1  }t=0  is  complete  in  L2[ 0, 1].  Converse  can  be  proved  similarly.  | 

We  can  now  express  G~l  in  a  Fourier  series  using  the  orthonormal  basis  {ij){  o 
F  W  Since  we  are  working  with  a  location  scale  family,  we  take  F  with  mean 
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0  and  variance  1,  namely  /  xdF  =  0,  /  x2dF  =  1.  In  this  case  by  Grarn-Schimidt 
orthonormalization  we  can  take  an  orthonormal  system  {  ipi  }^*L0  of  L2(F)  with  =  1, 
ipl  =  x}  since  these  two  are  orthonormal.  In  the  sequel  we  work  with  this  choice  of 
orthonormal  basis. 

Theorem  2.1,  Let  F  (£  I'2)  be  a  continuous  distribution  function  supported  on  an 
interval  and  with  mean  0  and  variance  1.  Let  a  comP^e^e  orthonormal  system 

of  L2(F)  =  {(j>  :  /  cfPdF  <  oo  }  with  'tpQ  =  1,  'ipi  =  x .  Then  for  any  G  £  T  2 

00 

(2.3)  G-1(n)  =  Tj  ai'll)AF~l[u))y 

i=0 

where  =<  G”1, 0  P”1  >  and  convergence  on  the  right  hand  side  is  in  L2[ 0,1]. 
Furthermore  a{  =0,  i  =  2, 3, . . . ,  2/  and  only  if  G  is  a  location  scale  transform  of  F. 

Proof :  We  need  only  to  prove  the  last  statement.  Let  fio  —  f  xdG.  Note 

«orz:::<G"1,l>=  [  G~'i{u)du  —  pbGl 
Jo 

ai  =<  G-X,F-X  >  =  <  G~x  —  not G~l  —  iiq  >l/2  p[G,F), 

OO 

<  G_1  -  P'Gt  G'1  —  fig  >=  y;  af . 

i=l 


Hence 

00 

(2.4)  p(<7,F)  =  a1/(X;«1?)1/2. 

*=1 

Therefore  />(G,  P)  =  1  if  and  only  if  a*  =  0,  i  =  2, 3, -  | 

If  we  normalize  G  as  well  to  have  mean  0  and  variance  1,  we  have  a0  =  fiG  =  0 
and  ]Ci==i  ai  =  I-  Hence  we  can  write 

00 

(2.5)  p(G,F)  =  a1=( 

»=2 

This  shows  that  if  we  represent  a  location  scale  family  of  distributions  by  the  normal¬ 
ized  disribution  G  with  mean  0  and  variance  1,  then  it  can  be  uniquely  represented  by 
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components  ai}  i  —  2,3, ...  with  respect  to  (infinite  dimensional)  coordinate  system  {^-o 

F~X  2-  We  call  ai’s  components  of  G  with  respect  to  F.  By  Theorem  2.1  the  origin  in 
this  coordinate  system  is  F  itself. 

1  he  components  ait  i  =  2, 3, . . . ,  can  be  used  in  two  ways.  First  it  gives  informa¬ 
tion  on  how  much  and  in  what  direction  G  deviates  from  F.  See  Section  2.3  below.  Second 

use  is  for  an  approximation  of  quantiles.  Let  ya,  xa  be  a-quan tiles  of  G  and  F,  respectively. 
Then  (2.3)  implies 

oo 

(2*6) 

1=0 

Hence  we  can  approximae  a-quantile  of  G  by  (a  function  of)  a-quantilc  of  F.  This  is 
analogous  to  Cornish-Fisher  expansion.  See  Fisher  and  Cornish  (1960).  Cornish-Fisher 
expansion  is  asymptotic  in  its  nature,  whereas  (2.6)  is  applicable  for  any  distribution  with 
finite  variance.  Concerning  the  convergence  of  (2.6)  the  following  distributional  result  is 

immediate  from  Theorem  2.1.  Since  ^-convergence  implies  convergence  in  distribution  we 
have 


Thoerem  2.2. 


Let  X ,  Y  be  distributed  according  to  F  and  G  respectively. 
Then  Xn  — ►  Y  m  distribution  as  n  — >  oo. 


Let  Xn  = 


furthermore  for  standard  distributions  and  standard  orthonormal  systems  (2.6) 
converges  pointwise.  This  is  an  obvious  advantage  of  (2.6)  over  Cornish-Fisher  expansion. 
More  precisely  we  have 


Theorem  2.3.  Let  G~\u)  be  piecewise  continuously  differentiable.  If  F ,  { ipi  are 

either  (i)  normal  distribution  and  the  Hermite  polynomials,  or  (ii)  Gamma  distribution  and 
the  Laguerre  polynomils,  or  (in)  Beta  distribution  and  the  Jacobi  polynomials,  properly 
normalized,  then  the  right  hand  side  of  (2.6)  converges  pointwise  at  every  continuity  point 
°f  G~l.  At  a  discontinuity  point  it  converges  to  (G-:1(m)  +  G~l{u  —  0))/2. 

Ihcsc  are  standard  results  in  the  theory  of  orthogonal  polynomials.  See  Chapter 
4  of  Lebedev  (1965)  or  Chapter  9  of  Szego  (1959).  Various  refinements  of  Theorem  2.3  can 
be  found  in  these  and  other  books  on  special  functions  and  orthogonal  polynomials. 
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2.3  Some  examples. 

We  present  components  of  several  distributions  with  respect  to  normal,  exponen¬ 
tial,  and  uniform  distributions.  Here  we  consider  distributions  for  which  G~l(u)  (or  jP"1(xt)) 
can  be  explicitly  written  and  hence  the  components  can  be  computed  rather  easily.  More 
extensive  tables  of  components  are  found  in  Takemura  (1983). 


unif 

exp 

logistic  weibull[2/3) 

weibull{  2) 

beta( 2,  2) 

wrt  normal 

wrt  normal 

wrt  normal  wrt  exp 

wrt  exp 

wrt  unif 

1 

.9772 

.9032 

.9959  .9692 

.9565 

.9959 

2 

.4212(3.71) 

.2423(3.22) 

-.2391(32.8) 

3 

-.1995(11.7) 

.0818(.09) 

.0902(.25)  — ,0404(.53) 

.1196(16.0) 

.0851(11.3) 

4 

— .one(.oi) 

.0151(.l5) 

-.0747(9.39) 

5 

.0669(1.79) 

— .0041(.00) 

— .0045(.00)  — .0076(.06) 

.0523(6.17) 

.0264(2.72) 

6 

.0044(.03) 

-.0392(4.36) 

7 

— .0258(.31) 

— .0028(.01) 

.0308(3.25) 

.0120(.95) 

8 

.0020(.00) 

-.0251(2.51) 

9 

.0106(.06) 

.0209(2.00) 

.0066(.41) 

10 

-.0177(1.63) 

11 

— .0046(.01) 

.0153(1.35) 

.0041(.2l) 

There  are  six  cases  listed  here.  Each  case  has  two  columns:  (i)  a^,  (ii)  100  X 
X^jfc+l  ai  /  2  ai  >  which  is  the  remaining  percentage  out  of  l  —  p{G,  F )2  when  a2, . . . ,  o,k 

are  taken  into  account.  The  six  cases  are  (i)  uniform  distribution  with  respect  to  normal, 
(ii)  exponential  distribution  with  respect  to  normal,  (iii)  logistic  distribution  with  respect 
to  normal,  (iv)  Weibull  distribution  with  parameter  .66  with  respect  to  exponential,  (v) 
Weibull  distribution  with  parameter  2  with  respect  to  exponential,  (vi)  Beta  distribution 
with  parameters  (2,2)  with  respect  to  uniform,  respectively.  Orthonormal  bases  used  are 
the  Ilerinite,  Laguerre,  and  Legendre  polynomials  for  normal,  exponential,  and  uniform 
distributions  respectively.  Details  of  computational  procedures  are  found  in  Appendix. 


§3  Components  of  the  Shapiro-Francia  statistic. 

In  the  previous  section  distributions  were  thought  as  population  distributions.  In 
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this  section  we  let  G  =  Gn  be  an  empirical  distribution  function  obtained  from  order 
statistics  X^  ^  •••  5s  X(ny  We  show  that  the  components  of  Gn  give  a  decomposition 
of  the  Shapiro-Francia  statistic:  W'  =  (£Xwmt)2/(E(JQ  -  Xf  •  ^m?)  where  m{  = 
E(X{i)). 

Note  that  Gn^(u)  =  X (,-j  for  (z  —  1)/ n  <  tz  <  z'/zz.  Hence  we  have 


(3.1) 


<  G"1,*-1  > 


r*/r 


SXW  If  ,W  ^ 
1=1  ^  (*-!)/« 


F~l{i/n) 


n  [F  (1 


((i-t)/n) 


xrfF(x) 


1  n 
-  X) 


t=i 


where  zn,:  is  the  mean  of  the  distribution  which  is  obtained  by  truncating  F  from  (z _ 

l)/n-quantile  to  z'/n-quantile  of  F  and  renormalizing  by  n.  Clearly  is  close  to  zn,-. 
Therefore  p(Gn,F)2  is  essentially  the  same  as  the  Shapiro-Francia  statistic  (1972).  See 
Shapiro  and  Wilk  (1965)  too.  Actually  p(Gn,F)2  seems  simpler  than  the  Shapiro-Francia 
statistic  because  in  (3.1)  is  generally  easier  to  compute  than  the  expected  value  of  z'-th 
order  statistic.  For  the  normal  case  see  Section  3.1  below.  Furthermore  it  is  intuitively 
clear  that  p{Gn,F )  -►  p(G,F )  as  zz  -+  oo  and  by  Proposition  2.2  the  test  based  on  p(Gn,F) 
is  consistent.  This  can  be  proved  by  simplifying  and  slightly  modifying  Sarkadi’s  proof 
(1975)  of  consistency  of  the  Shapiro-Francia  test. 


Although  ai  =  p(Gn,F)  gives  a  simple  omnibus  test  we  want  to  propose  a  larger 
class  of  tests  which  contains  ai  as  a  special  case.  As  discussed  in  Section  2.2  we  have  the 
relation  a\  =  1  —  ai  •  1°  v*ew  °f  fh"ls  relation  we  consider  tests  based  on  the  whole 

sequence  (02,03, ••  *)  rather  than  based  on  a2  alone.  Since  the  components  of  Gn  tend 
to  be  close  to  the  components  of  the  population  distribution  G,  we  can  easily  construct 
a  test  which  is  efficient  against  a  particular  alternative  G.  More  specifically  let  G  have 
components  a£,o§, ....  Now  using  the  idea  of  regression  let  T  =  E?L2.at-a?.  The  test 
with  the  rejection  region  T  >  c  clearly  has  high  power  against  the  alternative  G.  As  we 
can  see  from  the  examples  of  Section  2.3  components  generally  converge  to  zero  fairly  fast. 
Hence  in  1  =  Ei=2  a»'°?  ar)d  Ei=2°f  important  terms  are  the  first  few  terms.  It  is 
therefore  essential  to  be  able  to  interpret  low  order  components. 
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Now  we  apply  these  ideas  to  tests  of  specific  distributions. 


3.1  Tests  of  normality. 

Let  cj)(x )  =  be  the  standard  normal  density  and  {H ^(x)  }^=0  ^he  Hermite 

polynomials  given  in  (At)  of  Appendix.  With  respect  to  the  orthonormal  basis  {  Hic($~1(u))/\/k\  } 
components  are  given  by  =<  G~x,  H ^  o  >  / sy/kl  where  s 2  =  ][])”==  —  X)2  jn. 

Now 

n  (i/n) 

sVk\ak  =  T,  X(i)  /  Hk{x)<j>{x)dx 

•'♦“*((<- 1)/«) 

71 

—  ^  X fc) [Hfc — 1  (^(i — l)/n)^(^r(i — l)/n)  Ih — 1  (^i/n)0(-^i/n)3> 

i=l 

where  Za  =  <^~{{a)  is  a-quantile  of  the  standard  normal  distribution.  We  see  that  a^s  are 
very  simple  to  calculate.  In  particular  a\  =  p(Gn)( I>)  =  5^?=!  i)/w)“^(^i/n))/5 

corresponds  to  the  Shapiro-Francia  statistic. 

In  order  to  interpret  low  order  terms  let  us  look  at  them  as  regression  coefficients 
in  the  following  setting.  Consider  G~[  o  <b.  Plotting  this  function  corresponds  to  plotting 
the  observations  on  probability  paper,  where  the  normal  scores  are  taken  as  z-axis.  We 
give  the  weight  <j>{x)  to  the  z-axis  and  consider  the  following  weighted  least  square  problem: 

/oo  n 

(G'“1(<J>(x))  -  ^2  bkHk(x)/\/rk'.)2<j>{x)dx. 

.  -°°  fc=0 

Namely  we  approximate  G~l  o  $  by  a  polynomial  in  the  sense  of  weighted  least  squares. 
Because  of  the  orthogonality  of  Ilcrmite  polynomials,  the  minimizing  values  of  6o>...,fen 
are  equal  to  the  components  no, ... ,  an.  Note  that  H^x)  —  x2  —  l  and  H$(x)  =  —  3x. 

From  the  forms  of  these  polynomials  it  is  clear  that  a i  roughly  corresponds  to  skewness 
and  <23  roughly  corresponds  to  kurtosis.  Positive  a 2  indicates  skewness  toward  the  right 
and  positive  a%  indicates  a  heavier  tail  than  the  normal  distribution.  Another  way  of  seeing 
this  is  the  following.  Consider  s\/2a2 .  It  can  be  approximated  by  (l/n)  ]T) X^{m2  —  1) 
where  mi  =  E(X (q).  This  corresponds  to  (l/ro)]CP^(i)  “  ^(%))  —  —  Xi).  Similar 

consideration  applies  to  higher  order  components.  Examples  in  Section  2.3  comfirm  the 
above  interpretations. 


3.  Components  of  the  Shapiro-Francia  statistic. 
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For  symmetric  alternatives,  T  =  £~=1  4^/ £~=  0  a*k+l  =  l-af/E^o <4+1 
is  of  considerable  interest.  Actually  T  =  1  -  />(GS,  $)2  where  C3  is  such  that  p{Gn,  Ga)  = 
max{  />(Cn,  G)  |  G  is  symmetric  }.  The  maximizing  Ga  can  be  given  explicitly  as 


(3'2)  G'a1(«)  =  i(G'-1(«)-G'-1(l-tt)). 

By  symmetry  we  have  <  G~\H2ko <f>~i  >  =  0,  <  G7\H2k+1  >  =  <  G~l,H2k+1o 

>=a2k+1.  Note  that  s2p(Gn,  G's)2=  ££L0  a2fc+i  •  By  Cauchy-Schwarz  it  is  straightfor¬ 
ward  to  show  that  this  6%  maximizes  p{Gn,G)  where  G  is  symmetric.  Furthermore  1  - 
p{Ga,$>)  =1  —  a2/ £a_q  a2k+\  as  claimed  above.  The  denominator  ££L0ci2fc+1  ls  the 
variance  of  C3  which  can  be  calculated  as  follows.  Note  that  C3  is  the  empirical  distribution 
function  of  (X(1) -X(n))/2,  (X(2)  -X(n_1))/2, . . .  [X{n) -X(1))/2.  Therefore  £)£0  °ifc+i  = 
]Ci=iPQt)  ~  -^(n—i+i))2 /^n.  Thus  T  can  be  computed  easily. 


We  now  turn  to  the  asymptotic  null  distribution  of  each  component.  In  the  case 
ol  normality  we  have  the  following  nice  result: 


Theorem  3.1.  Under  the  null  hypothesis  of  normality  {^s/nail , . . . ,  y/naik)  ,2  <  i\  < 
...  <  ik  are  asymptotically  independently  distributed  according  to  the  normal  distribution 
with  mean  zero  and  variance  1  /(i,  +  1),  j  =  1, . . . ,  k,  respectively. 

1  his  can  be  proved  using  results  concerning  the  asymptotic  normality  of  linear 
combinations  of  order  statistics.  For  example  the  conditions  of  Theorem  1  of  Shorack 
(1972)  are  straightforward  to  check.  See  Example  lb  of  Shorack  (1972)  in  particlar. 
Asymptotic  independence  of  components  follows  from  the  formula  for  the  asymptotic 
covariance  in  Corollary  4  of  Chernoff,  Castwirth,  and  Johns  (1967).  Similarly  under  mild 
regularity  conditions  on  G  it  is  straghtforward  to  show  that  nl/2T  —  n1/2  £“  aia 9  has 
an  asymptotic  normal  distribution  with  mean  zero  and  variance 


i 

(min(s,  t)  -  st)J{s)J{t)/[ct>{<b-\s))<t>{$-ltt))\dsdt 


OO 


=  X>°)2/(*' + 


1), 


where  {a?}  are  components  of  G  and  J[u)  =  G~l{u)  -  a°$~l(u).  See  Theorem  3  of 
Chernoff,  Castwirth,  and  Johns  (1967),  Corollary  4.1  of  Stigler  (1969),  and  Theorem  1  of 
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Shorack  (1972)  for  appropriate  regularity  conditions.  As  for  the  test  against  symmetric 
alternatives  :  T  =  1  —  af  /  ]CfcLo  a2A;+i >  us*nS  results  from  de  Wet  and  Venter  (1972, 
1973a)  it  can  be  shown  that  the  distribution  of  n[T  —  fin)  approaches  the  distribution  of 
(1/2)  ~~  l)/(&  +  1)  where  fin}  s  are  suitable  constants  and  Yi,  I2,  •  •  •  >  are  indepen¬ 

dent  standard  normal  random  variables.  This  is  intuitively  clear  from  Theorem  3.1. 

3.2  Tests  of  exponentiality. 

Let  f(x)  =  e“x,  /'(z)  =  1  ~  e— x  and  {  L/c(a:)  }kL0  be  the  Laguerre  polynomials 
given  in  (Al)  of  Appendix.  In  this  case  =  —  log(l  —  u)  and 

/  Lk{x)f(x)  =  L*k[- log(l  -  (*'  -  l)/n)]  -  L*k[-  log(l  -  i/n)\, 

J  F~'((i-L)/n) 

where  L£(z)  ==  (— 1  )ri~‘{(dri~1  /dxn~~1)(xne~x).  Similar  considerations  as  in  the  normal  case 
indicate  that  positive  a 2  corresponds  to  a  heavier  tail  toward  the  right  than  the  exponential 
distribution.  This  is  confirmed  by  examples  in  Section  2.3.  Individual  components  and  the 
test  of  the  form  T  =.  aiai  can  t>e  USC(^  as  *m  lhc  normal  case. 

The  asymptotic  null  distribution  is  not  as  nice  as  in  the  normal  case.  Again  using 
results  of  ChcrnolT,  Gastwirth,  and  Johns  (1967)  and  others  we  have 

Theorem  3.2.  Under  the  null  hypothesis  of  exponentiality,  (\/nailf . . . ,  y/na{k),  2  < 
i  1  <  ...  <  ik  are  asymptotically  normally  distributed  with  mean  zero  and  covariance 

r  CO 

(3.3)  =  /  L**{x)L;*{x)e-*dx, 

Jo 

where  {i\)L**(z)  =  /*  U{y)dy  -  /0°°  e~x  /Qx  Li(y)dydx. 

Li*[x)  and  a can  be  explicitly  evaluated  using  (At)  in  Appendix. 

3.3  Tests  of  uniformity. 

Here  the  results  are  similar  to  those  for  the  exponential  distribution.  Let  {  Pk[%} 
be  the  Legendre  polynomials  given  in  (Al).  Then  {Pk{(2u  —  l)y/2k  +  1  }£L0  forms  a  com¬ 
plete  orthonormal  system  on  [0,1].  f(/*tynPk( 2w  —  1  )du  can  be  explicitly  evaluated  using 


4.  Discussion. 
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(Al).  Similar  interpretations  for  low-order  components  can  be  given  as  in  the  normal 
and  exponential  cases.  Under  the  null  hypothesis  n1/2ot’s  are  asymptotically  normally 
distributed  with  mean  zero  and  covariance 


where  P- ( x )/ s/li  +  1  fQ  P»( 2u  —  1  )du  —  /0  fQ  Pt( 2u  —  1  )dudx.  This  can  be  explicitly 
evaluated  using  (Al). 

§4  Discussion. 

Here  we  discuss  relations  and  relative  advantages  of  our  approach  to  other  tech¬ 
niques  especially  the  components  of  Cramcr-von  Mises  statistics.  Our  approach  is  based 
on  the  empirical  quantile  function  whereas  the  latter  is  based  on  the  empirical  distribution 
function. 

One  difficulty  with  techniques  based  on  empirical  distribution  function  is  that 
it  is  not  very  natural  to  estimate  parameters  in  that  setting.  When  quantile  function  is 
used  instead,  estimation  of  location  and  scale  parameters  are  taken  care  of  by  virtue  of 
invariance,  although  actually  this  reduces  to  estimating  mean  and  variance  by  their  sample 
quantities.  In  fact  the  analysis  becomes  quite  more  complicated  and  individual  cases  have 
to  be  treated  separately  when  various  parameters  are  estimated.  See  for  example  Table  1 
of  Durbin,  Knott,  and  Taylor  (1975). 

The  reason  that  individual  cases  require  separate  treatments  is  that  by  “components” 
Durbin  et  al.  mean  components  which  are  asymptotically  independently  distributed  from 
each  other.  In  our  case  this  corresponds  to  obtaining  principal  components  of  the  (infinite 
dimensional)  asymptotic  covariance  matrices  given  in  (3.3)  and  (3.4).  Although  this  might 
be  a  logically  natural  thing  to  do  on  the  ground  that  (i)  components  are  uniquely  deter¬ 
mined,  (ii)  asymptotic  distribution  is  understood  better  in  this  way,  it  might  not  be  very 
useful  from  practical  viewpoint.  When  components  change  depending  on  what  parameter 
is  estimated,  it  becomes  harder  to  interpret  them.  The  usefulness  of  the  definition  of  our 
components  is  in  Theorem  2.1  which  is  derived  on  the  basis  of  population  distributions 
rather  than  sampling  properties  and  the  interpretations  of  individual  components  seem  to 
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be  clearer.  On  the  other  hand  our  components  do  not  possess  the  property  of  asymptotic 
independence  in  general.  The  case  of  normal  null  hypothesis  is  a  fortunate  exception. 


A  formal  correspondence  between  [F~x  (u)  —  F~x  [u))2  du  and  Cramer-von  Mises 
type  statistics  is  given  as  follows.  F^fa)  can  be  written  as  F~~1( F”1)  where  T”1  is 
the  quantile  function  obtained  from  observations  from  uniform  distribution.  Note  that 
j F,“1(r“l(ti))  —  F~x{u)  (T~l(w)  —  v)/ f{F-~x{u))  and  —  u )  approaches  a 

Brownian  bridge  as  n1//2(r n(u)  —  u )  does.  See  Appendix  in  Shorack  (1972)  for  these 
results.  Therefore  fQ{F'^x{u)—F'~1(u))2du  can  be  understood  as  Cramer- von  Mises  statistic 
where  1  / f{F~x{u))2  is  taken  as  a  weight  function.  See  Gregory  (1977a, b).  Starting  from 
the  classical  Anderson  and  Darting  statistic  (1952)  various  authors  considerd  Cramer-von 
Mises  statistics  with  various  weight  functions.  See  de  Wet  and  Venter  (1973a, b),  Gregory 
(1977a, b),  Petti tt  (1977,1978).  In  particular,  Pettitt  (1977)  considers  the  components  of 
Cramer-von  Mises  statistic  with  the  weight  function  1/</>(<F~’1('u))2  where  (j)  =  <J>'  is  the 
standard  normal  density.  From  the  above  argument  it  is  not  surprising  that  there  are  close 
connections  between  his  results  and  our  results  in  Section  3.1. 


Appendix:  Computation  of  components. 


We  discuss  computational  procedures  used  to  obtain  components  in  Section  2.3. 


Let 


-  (-l)ne*2/2. 


(Al) 


Hn{x) 

Ln(x)  =  (— 1)>V 

Pn(x) 


dxr 


„  In/2]  ,  . 

-®! 2/2  _  (~U  n!  xn-2fe 


dx 7 


-E 


" f  2 kk\{n  —  2fc)! 
(— l)fc(n!)2 


„n  —  k 


0  k\{{n~k)\)2' 


1  dT 


’/2]  t  i \fc 


2 nn\  dxn 


(x*  - 1  r  =  E  = 


k—0 


(— l)fe(2n  —  2fc)!  2fc 

2  nkl(n  —  k)\(n  —  2  fc)! 


be  the  Hermite,  Laguerre,  and  Legendre  polynomials,  respectively.  Then  {  Hn(x)/y/n\  }^Lo> 
{  Ln(x)/n\  }^0,  {  Pn(2x  —  l)y/2n  +  1  }^L0,  are  complete  orthonormal  systems  of  L2-spaces 
associated  with  standard  normal,  standard  exponential,  and  uniform  distributions  respec¬ 
tively. 

For  symmetric  distributions  we  have  the  following  result. 


Appendix:  Computation  of  components. 


IS 


Proposition  Al.  Let  G  €  T2  be  symmetric  about  its  mean  pG.  Let  F,  ipk  be  either 
(i)  standard  normal  and  the  Hermite  polynomials,  or  (ii)  uniform  and  the  Legendre  poly¬ 
nomials,  suitably  normalized.  Then  a2k  =<  G~l,ip2k  °  F~l  >=  0,  k  —  1, 2, ... . 

This  follows  from  the  fact  that  ip2k  o  F~ 1  is  even  about  u  —  1/2.  For  some  cases 
the  following  lemma  is  useful. 

Lemma  Al.  Let  F,G  have  positive  densities  f,g  respectively.  Let  hn(x)f(x)  be  an 
indefinite  integral  of  ipn[x)f(x).  Suppose  that  limu_lj0  G~x  (u)f(F~1(u))hn(F~1  (u))  =  0. 
Then 

an==~I0  i(u))du- 

Proof  is  by  integration  by  parts. 

Now  we  discuss  each  of  6  cases  treated  in  Section  2.3. 

Case  l.  Uniform  with  respect  to  Normal.  It  is  easy  to  obtain  ay  —  1/(2^/tt).  Since 
G(u)  =  u  is  bounded  the  assumption  of  Lemma  Al  is  clearly  satisfied.  Hence  V n\an  = 
fo  J?n-i(<I>-1(u))<?>($-1(u))du.  Using  Hk{x)  =  xllk^[x)  -  {k  -  1  )Hk_2{x)  we  obtain  the 
following  recurrence  relation  V^\an  =  -[(n  -  2)/2]y[^2)!an_2.  This  with  ox  above 
gives  all  components  in  view  of  Proposition  Al. 

Case  2.  Exponential  with  respect  to  Normal.  Again  the  condition  of  Lemma  Al 
is  satisfied  and 

This  was  evaluated  by  using  Hermite- Gauss  quadrature  formula  with  20  points. 

Case  3.  Logistic  with  respect  to  Normal.  Given  the  components  of  exponential 
distribution,  the  components  of  logistic  distribution  can  be  obtained  immediately.  Let 
G  (u)  =  -  log(l  -  u)  be  the  quantile  function  of  the  exponential  distribution.  Then  the 
quantile  function  of  the  Logistic  distribution  can  be  written  as  (1/2)(C_1(«)  —  G_1(l  —  n)) 
corresponding  to  Gj1  in  (3.2).  Therefore  as  in  Section  3.1  we  see  that  the  odd  order 
components  of  logistic  distribution  are  the  same  as  those  of  exponential  distribution  up  to 
a  multiplicative  constant.  Even  order  components  vanish  by  Proposition  Al. 


u 


Cases  4  and  5.  Wcibull  with  respect  to  Exponential.  In  this  case  components 
can  be  explicitly  written  down.  Let  G(x )  =  1  —  exp(— xc)  be  the  distribution  function  of 
Weibull  distribution  with  parameter  c.  It  is  the  distribution  of  Yxlc  where  Y  is  a  standard 
exponential  random  variable.  Hence  G~1(F(y))  =  yl/c  and 


n!a„  =  /  G  i(F(y))Ln(y)e  vdy 
J  o 

poo 

=  /  yX/cLn{y)e~vdy 
Jo 

n 

—  ^2  CjT(l/c  +  n  —  i  +  1), 
i=0 

where  Ci’s  are  coefficients  of  the  Laquerre  polynomial  given  in  (Al). 


Case  6.  Beta(2,2)  with  respect  to  UniCorm.  Here  the  beta  distribution  is  taken  to 
be  on  [-1,1]  for  convenience.  Let  g(x)  —  (3/4)(l  —  x2).  Then  G{x)  =  (3/4)*  —  (l/4)a;3  + 1/2. 
Therefore 


1 

n  +  -aT 


f  G~l {u)Pn{2u  —  l)du 
Jo 

J  xPn(2G(x)  —  L)g(x)dx 
1 1  xFn^\x  ~ 


This  can  be  easily  evaluated  using  (Al). 
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20.  Abstract... 


Let  F  be  a  distribution  function  on  the  real  line  having  finite  vari¬ 
ance.  Then  the  quantile  function  F-1  belongs  to  L2[0,1].  Hence  F-1 
can  be  expanded  in  a  Fourier  series  with  respect  to  an  appropriate  complete 
orthonormal  system  of  L2[0,1]  •  When  F  is  taken  to  be  an  empirical  dis¬ 
tribution  r unction  Fq  ,  this  leads  to  a  decomposition  of  the  Shapiro-Francia 
test  statistic.  This  is  analogous  to  the  components  of  the  Cramer-von  Mises 
statistic  discussed  by  Durbin  et  al.  (1972,1975).  The  technique  is  applicable 
to  a  general  distribution  and  tests  of  exponential ity  and  uniformity  are 
discussed  in  addition  to  tests  of  normality. 
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